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Memory Device and Method for Redundancy/Self-Repair 
Background 

Volatile and non-volatile memory arrays can have yield degradations due to 
problems occurring during manufacture of the memory array as well as during use of the 
memory array in the field. To improve manufacture-related yield in some memory arrays, 
the memory array can be tested at the factory, and redundancy circuits in the memory 
device can replace faulty blocks of memory (e.g., faulty bits, bytes, columns, rows, or 
sub-arrays) prior to shipment. To improve field-related yield, parity checking or error- 
correcting code ("ECC") algorithms can be used. The use of these algorithms is 
particularly important for once-programmable non- volatile memories (e.g., PROM) 
because individual memory cells cannot be programmed, tested, and then erased prior to 
shipment. Commonly-used ECC algorithms can correct single-bit errors and detect (but 
not correct) multi-bit errors. More-powerful ECC algorithms can be used to correct 
multi-bit errors, but the overhead of the circuitry and extra syndrome bits associated with 
those more-powerful algorithms may be undesirable. 

Summary 

The present invention is defined by the following claims, and nothing in this 
section should be taken as a limitation on those claims. 

By way of introduction, the preferred embodiments described below provide a 
memory device and method for redundancy/self-repair. In one preferred embodiment, a 
memory device is provided comprising a primary block of memory cells and a redundant 
block of memory cells. In response to an error in writing to the primary block, a flag is 
stored in a set of memory cells allocated to the primary block, and the redundant block is 
written into. In another preferred embodiment, an error in writing to a primary block is 
detected while an attempt is made to write to that block. In response to the error, the 
redundant block is written into. In yet another preferred embodiment, a memory device is 
provided comprising a three-dimensional memory array and redundancy circuitry. In still 
another preferred embodiment, a method for testing a memory array is provided. Other 
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preferred embodiments are provided, and each of the preferred embodiments described 
herein can be used alone or in combination with one another. 

The preferred embodiments will now be described with reference to the attached 
drawings. 

Brief Description of the Drawings 

Figure 1 is an illustration of a preferred embodiment in which a plurality of 
primary blocks of memory cells is associated with a single redundant block of memory 
cells. 

Figure 2 is a block diagram of a memory device of a preferred embodiment. 

Figure 3 is an illustration of two sets of primary and redundant blocks used to 
illustrate various mapping algorithms of a preferred embodiment. 

Figure 4 is a block diagram of a memory device of another preferred embodiment 
in which a redundancy address matching circuit is used. 

Figure 5 is a block diagram of a memory device coupled with a host device of a 
preferred embodiment. 

Figure 6 is an illustration of a memory array of the memory device of Figure 5. 

Figure 7 is an illustration of a memory array of a preferred embodiment in which a 
page of memory is distributed among bays in a stripe. 

Figure 8 is an illustration of a page sub-register of a preferred embodiment. 

Figure 9 is an illustration of a sub-array of a preferred embodiment. 

Figure 10 is an illustration of a page sub-register of a preferred embodiment that is 
extended to hold a section of spare array data. 

Figure 1 1 is an illustration showing the location of spare array data in the 
extended sections of nine sub-registers of a preferred embodiment. 

Figure 12 is a flow chart illustrating a read operation of a preferred embodiment. 

Figure 13 is a flow chart illustrating a write operation of a preferred embodiment. 

Figure 14 is a flow chart illustrating a write operation of another preferred 
embodiment. 
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Detailed Description of the Presently 
Preferred Embodiments 

By way of overview, the preferred embodiments described herein relate to 
redundancy/self-repair operations for a memory array. In at least some of the preferred 
embodiments, one or more redundant blocks of memory cells are associated with a group 
(or "bin") of one or more primary blocks of memory cells. If there is an error in writing 
to a primary block in the bin, a redundant block is used to replace that primary block. As 
used herein, the term "block" refers to an allocation unit of memory and can be any size 
ranging from a single memory cell to all of the memory cells in a memory array. A block 
can be, for example, a row of memory cells, a column of memory cells, a page of memory 
cells, or a sub-array of memory cells. 

In one preferred embodiment, each block has a set of memory cells allocated to it, 
which is used in the redundancy/self-repair operation. This embodiment will now be 
described in reference to Figure 1. As shown in Figure 1, a bin 10 of primary blocks 
(blocks 1-N) is associated with a single redundant block (redundant block 20), and each 
block has a set of memory cells 30 allocated to it. As used herein, a "set" can include one 
or more than one member. In operation, if there is an error in writing to a primary block, 
a flag is stored in the set of memory cells 30 allocated to that primary block, and the 
redundant block 20 is used to replace that block. During a read operation, the set of 
memory cells allocated to a primary block is read. If a flag is stored in the set of memory 
cells 30 allocated to that block (indicating that the redundant block replaced the primary 
block in a previous write operation), the redundant block 20 is read. 

As used herein, the term "flag" broadly refers to any suitable mechanism that can 
be used to indicate that an error occurred in writing to a block. A flag can comprise one 
bit or more than one bit. Depending on the convention used, a flag can be stored by 
action or by omission. For example, consider the situation in which the set of memory 
cells 30 is a single write-once memory cell with an initial, un-programmed state of Logic 
1. If Logic 0 is designated as indicating an error, the act of storing the flag comprises 
switching the initial state of the memory cell from Logic 1 to Logic 0. If, however, Logic 
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1 is designated as indicating an error, the act of storing the flag comprises allowing the 
state of the memory cell to remain Logic 1 . 

In one preferred embodiment, the primary block comprises a plurality of smaller 
blocks, and the error in writing to the primary block occurs when there is an error in 
writing at least one bit in one of the smaller blocks. In a presently preferred embodiment, 
a primary block is a page of 512 bytes of data bits, and the smallest- writable unit is eight 
bytes of data bits (an "oct-byte"). If there is an error in writing an oct-byte to the memory 
array, the entire page containing that oct-byte is replaced by a redundant page. Of course, 
the size of a block and the boundaries across which errors are detected can be varied and 
are likely to be driven primarily by factors such as desired design simplicity/complexity, 
fault coverage, and overhead. For example, designing a block to be an entire page 
requires relatively little overhead to store the flag. However, in the illustration set forth 
above, replacing an entire page if there is an error in writing an oct-byte of the page can 
quickly fill the redundancy areas of the memory array and, therefore, may limit the total 
number of faults that can be repaired. Designing a block to be an individual oct-byte can 
reduce this problem at the price of increased overhead since increasing the number of 
blocks increases the number of sets of bits needed to store flags. 

An error in a primary block can be defined as any type of error. For example, an 
error can be defined as occurring if a bit cannot be written as intended in one or more 
memory cells of a block. This definition of an error may be preferred if the block is not 
protected by an error protection scheme. However, if the block is protected by an error 
protection scheme that corrects single-bit errors but does not correct double-bit errors, an 
error is preferably defined as occurring if bits cannot be written as intended in two 
memory cells. More generally, an error can be defined as occurring if bits cannot be 
written as intended in X memory cells, where X is sufficient to introduce an error that is 
uncorrectable by an error protection scheme protecting at least some of the memory cells 
of a block. In the presently preferred embodiment, an ECC scheme is used to correct only 
single-bit errors in an oct-byte. An error occurs in writing to a page (the primary block in 
the presently preferred embodiment) when a double-bit error occurs in writing an oct-byte 
of that page. 
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Turning again to the drawings, Figure 2 is an illustration of a memory device 40 
of a preferred embodiment that can be used to implement the redundancy/self-repair 
operation described above. The memory device 40 comprises a memory array 50, which 
comprises a primary memory array 52 and a redundant memory array 54. The primary 
memory array 52 comprises primary blocks of memory cells, and the redundant memory 
array 54 comprises redundant blocks of memory cells. While the primary memory array 
52 is shown separated from the redundant memory array 54, the primary and redundant 
memory arrays 52, 54 can be integrated in the memory array 50. Additionally, instead of 
being part of the same memory array 50, the primary and redundant memory arrays 52, 54 
can be separated into two or more memory arrays. It should also be noted that a 
redundant block does not necessarily need to be used exclusively for redundancy. For 
example, a block of memory cells can be designated as a redundant block for one write 
operation and, if not used during that write operation, later be used as a primary block for 
a different write operation. Likewise, a block of memory cells can be designated as a 
primary block for one write operation and be used as a redundant block for another write 
operation. The memory device 40 also comprises read/write control 60, an exception 
handler 70, a row decoder 80, and a redundant row decoder 90. 

In operation, the read/write control 60 attempts to write data in the primary 
memory array 52 at a given address. As used herein, the term "data" refers broadly to 
information that is to be stored in the memory array 50 and does not necessarily refer to 
data bits {e.g. , "data" can include syndrome bits). In response to an error occurring in 
writing to one of the primary blocks, the read/write control 60 stores a flag in the set of 
memory cells allocated to that primary block and provides a "fail" signal to the exception 
handler 70. As used herein, an action taken "in response to" a condition can occur 
immediate after the occurrence of that condition or at some time after the occurrence of 
that condition. In response to the "fail" signal, the exception handler 70 generates a "sel 
redundant rows" signal, which disables the row decoder 80 and enables the redundant 
row decoder 90. The redundant row decoder 90 stores the mapping between the primary 
blocks and the redundant blocks and selects the address of the redundant block that is 



6 



associated with the failed primary block. The read/write control 60 then stores the data in 
the redundant block specified by that redundant address. 

When an error occurs in writing to a primary block, the read/write control 60 can 
cease writing bits to the primary block. For example, when an error occurs, the read/write 
control 60 can cease writing bits to the primary block, transfer all previously-written bits 
to the redundant block, and then write the remaining bits in the redundant block. 
Alternatively, the read/write control 60 can allow the write operation to proceed in the 
primary block. When the write operation is complete, the data is written to the redundant 
block. The data written to the redundant block can be read from a register providing the 
data to the memory array 50 or from the valid memory cells in the primary block. 

When a command is received to read the failed primary block, the read/write 
control 60 reads the set of memory cells allocated to the failed primary block and detects 
that a flag is stored therein. The read/write control 60 then provides a "fail" signal to the 
exception handler 70, which generates a "sel redundant rows" signal to disable the row 
decoder 80 and enable the redundant row decoder 90. The redundant row decoder 90 
selects the redundant block associated with the failed primary block, and the read/write 
control 60 reads the data stored in the redundant block. It should be noted that, in 
operation, some or all of the memory cells in the failed primary block may be read along 
with the set of memory cells allocated to the primary block, depending on how many 
memory cells are read in a given read operation and when the set of memory cells is read 
with respect to other memory cells in the block. To increase read bandwidth, the memory 
device 40 can be provided with extra sense amplifier(s) to read the set of memory cells in 
parallel with some or all of the memory cells in a primary block. 

As described above, the mapping between the primary blocks and the redundant 
blocks is stored in the redundant row decoder 90. Any suitable mapping technique can be 
used. For example, borrowed from cache memory terminology, direct mapping, set- 
associative mapping, and fully-associative mapping algorithms can be implemented. 
Figure 3 is an illustration of two sets of primary and redundant blocks that will be used to 
illustrate those algorithms and their advantages and disadvantages. Set A comprises four 
primary blocks with addresses A00, A01, A 10, and Al 1 and two redundant blocks with 



7 



addresses RAO and RA1 . Set B comprises four primary blocks with addresses BOO, B01 , 
BIO, and Bl 1 and two redundant blocks with addresses RBO and RBI . It is preferred that 
physically adjacent primary blocks do not map to the same redundant block in order to 
reduce the yield impact of large defects that may cover several blocks. In this illustration, 
direct mapping is used to map based on the least significant bit in an address. Here, the 
least significant bit of 0 (A00 and A 10) maps to RAO, and the least significant bit of 1 
(A01 and Al 1) maps to RA1. With direct mapping, if A00 is replaced by RAO, A10 
cannot be replaced. 

With set-associative mapping, any of the primary blocks in a given set can map to 
any of the available redundant blocks in that set. In this illustration, A00, A01, A 10, and 
Al 1 map to RAO or RA1, and BOO, B01, BIO, and Bl 1 map to RBO or RBI. 
Accordingly, if A00 is replaced by RAO, an error in A10 can be replaced by RA1. 
However, if an error occurs in a third primary block (e.g., All), that primary block 
cannot be replaced. With fully-associative mapping, A00, A01, A10, Al 1, BOO, B01, 
BIO, and Bl 1 map to RAO, RA1, RBO, or RBI. Accordingly, up to four primary blocks 
can be replaced with this scheme. As shown by this illustration, using a direct mapping 
technique minimizes circuit overhead at the risk of reduced yield, and set- and fully- 
associative mapping can provide higher yield at the expense of more circuit overhead. 

In the preferred embodiment described above, a set of memory cells allocated to a 
primary block was read to determine whether or not that primary block was replaced by a 
redundant block. If a flag is stored in the set of memory cells, the redundant block is 
read. In an alternate embodiment, a redundancy address matching circuitry is used 
instead of a set of memory cells. This alternative will be illustrated in conjunction with 
Figure 4, which is a block diagram of a memory device 91 that uses a redundancy address 
matching circuit 92. In operation, if an error occurs in writing to one of the primary 
blocks, the read/write control 93 provides a "fail" signal to the redundancy address 
matching circuit 92. In response to the fail signal, the redundancy address matching 
circuit 92 generates a signal to disable the row decoder 94 and enable the redundant row 
decoder 95 to store the data in the redundant block. The redundancy address matching 
circuit 92 also stores the address of the failed primary block. When that address is later 
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provided to the memory device 91 during a read operation, the redundancy address 
matching circuit 92 recognizes the address and disables the row decoder 94 and enables 
the redundant row decoder 95 so that data stored in the redundant block will be read. 

In the embodiments described above, one or more components in the memory 
device were used to provide redundancy/self-repair functionality. In an alternative 
embodiment, instead of having "on-chip" support for redundancy/self-repair, a host 
device in communication with the memory device is responsible for redundancy 
operations. For example, when an error occurs in writing a primary block, the error can 
be reported to the host device, and hardware and/or software in the host device can direct 
write and read operations to the appropriate redundant block. The host device can store 
mapping information in the memory array, so that when then memory device is used with 
a different host device, that different host device will have the information necessary to 
direct write and read operations. 

The error in writing to a primary block in any of these preferred embodiments can 
be detected in any suitable manner. For example, after one or more attempts have been 
made to program a memory cell, a read operation can be performed on the memory cell to 
determine if the memory cell is in the programmed state (e.g., Logic 0 instead of Logic 
1). If the memory cell is not in the programmed state (indicating that the memory cell 
cannot be programmed), an error occurred in writing to that memory cell. As another 
example, instead of using a separate read operation to determine if the memory cell is in 
the programmed state, the programmed state of the memory cell can be sensed while 
attempting to program the memory cell. This sensing-while-programming technique is 
described in detail in "Memory Device and Method for Sensing while Programming a 
Non-Volatile Memory Cell," U.S. patent application serial no. 09/896,815, which is 
assigned to the assignee of the present application and is hereby incorporated by 
reference. As described in that application, the sensing-while-programming technique 
senses the condition of a memory cell during the write operation to that memory cell. 
Unlike approaches that use a series of write and read pulses to determine whether a 
memory cell is in a programmed state, the sensing-while-programming technique senses 
the programmed state of the memory cell while the programming pulse is being applied to 
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a wordline. Accordingly, as bits in a block are written in parallel or one-by-one, the 
success of the programming attempt is continuously monitored. Because the wordline as 
well as the unselected bitline are kept at a high voltage during sensing, there is no 
overhead in switching to read voltage conditions, as is encountered in other approaches. 

Returning to the drawings, Figure 5 shows a memory device 100 of a preferred 
embodiment coupled with a host device 200. As used herein, the term "coupled with" 
means directly coupled with or indirectly coupled with through one or more named or 
unnamed components. As shown in Figure 5, the memory device 100 comprises I/O 
buffers 1 10, a control block 120, an ECC block 130, a page register 140, a memory array 
150, a command register 160, a status register 170, and a configuration register 180. The 
command register 160 passes commands {e.g., read, write) received from the host device 
200 to the control block 120, the status register 170 holds information from the control 
block 120 regarding the status of certain operations (e.g., a failure during a write 
operation), and the configuration register 180 can be used during a test operation to 
instruct the control block 120 to disable ECC syndrome bit generation. 

The page register 140 is a buffer structure between the memory array 150 and a 
Smart Media Interface (SMI) in communication with the host device 200. Bits are read 
from and stored in the memory array 150 a page-at-a-time, and the page register 140 
holds a page of memory that is going to be stored in the memory array 1 50. The page 
register 140 also holds a page of memory that is read from the memory array 1 50. In this 
preferred embodiment, a page of memory comprises 512 bytes of main data and 64 bytes 
of ECC syndrome bits for that data. During a write operation, 512 bytes of data and 64 
bytes of ECC syndrome bits are assembled in the page register 140 before being written 
into the memory array 150. In operation, the I/O buffers 110 temporarily store the 
incoming bytes and pass them to the control block 120. The control block 120 comprises 
a register that can hold eight bytes of data (an "oct-byte"). When eight bytes are stored, 
the oct-byte is sent from the oct-byte register to the ECC block 130, which generates a 
byte of ECC syndrome bits for the oct-byte. The oct-byte and its ECC syndrome byte 
(together, the ECC word) are then stored in the page register 140. (As used herein, the 
term "ECC word" refers to a string of bits, some of which are data bits and others of 
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which are ECC syndrome bits generated from the data bits.) The page register 140 is 
populated one oct-byte at a time until it is full (i.e., until 64 ECC words are stored in the 
page register 140). The contents of the page register 140 are then transferred to the 
memory array 150 one byte at a time. (Partial page writes, where the contents of the page 
register 140 are transferred to the memory array 150 before the page register 140 is 
completely full, can also be implemented.) During a read operation, 512 bytes of main 
data and 64 bytes of ECC syndrome bits are read from the memory array 150 into the 
page register 140 and are then serially pushed out from the SMI to the host device 200. 

Returning to the drawings, Figure 6 is an illustration of the memory array 150. 
The memory array 150 comprises 88 sub-arrays (or "tiles"). As used herein, the term 
sub-array refers to a contiguous group of memory cells having contiguous word and bit 
lines generally unbroken by decoders, drivers, sense amplifiers, and input/output circuits. 
The core of the memory array 1 50 is 72 full tile elements organized in an array of eight 
rows (which will be referred to herein as "stripes") and nine columns (which will be 
referred to herein as "bays"). In this preferred embodiment, the silicon real estate under 
the sub-arrays is used for non-memory logic circuits, such as row and column decoders. 
As shown in Figure 6, the row and column decoders, which are designated by "R"s and 
"C"s, respectively, are arranged under the memory array 150 in a checkerboard style to 
provide for a high density layout. More details on this arrangement can be found in 
"Memory Device with Row and Column Decoder Circuits Arranged in a Checkerboard 
Pattern under a Plurality of Memory Arrays," U.S. patent application serial no. 
09/896,814, which is assigned to the assignee of the present invention and is hereby 
incorporated by reference. 

Each sub-array comprises 1024 rows and 1024 columns in eight layers. This 
structure comprises the primary blocks of memory cells and provides a main 1 MB 
storage element. Accordingly, the memory array 1 50 can store 64 MB of main data and 8 
MB of syndrome bits for that data. Each sub-array also comprises eight additional rows 
for redundancy (in each layer). These additional rows comprise the redundant blocks of 
memory cells. Further, each sub-array also comprises eight rows and four columns of test 
memory, half along each side of the sub-array, which contain sacrificial test bits. 
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Alongside the 72 foil sub-array elements, there are 16 smaller sub-arrays positioned in the 
columns along the two outer edges of the core array. The smaller sub-arrays are about Va 
the size of the full sub-arrays. Two sub-arrays on the same stripe are logically merged 
into a single element with 1024 rows and 512 columns. These smaller sub-arrays will be 
referred to herein as "spare arrays." The spare arrays are used to store sideband data, 
ECC bits for the sideband data, and data used for redundancy/self repair, all of which will 
be referred to herein as "spare data" (as compared to main data). The memory array 1 50 
in this preferred embodiment is byte addressable. 

In this preferred embodiment, a memory page comprises 512 bytes of main data, 
64 bytes of ECC syndrome bits for the main data, 16 bytes of spare data, two bytes of 
ECC syndrome bits for the spare data, and 1 8 bytes of redundancy/self repair and extra 
data. The main data and the main ECC are mapped into the nine main sub-arrays in a 
stripe, and the remaining information is mapped into the two minor sub-arrays in the 
stripe. Preferably, bays 0-3 and 5-8 store the 512 bytes of main data, bay 4 stores the 
ECC bits for the main data, and bay 9 stores the spare data. 

In this preferred embodiment, the main data and ECC of a page are distributed 
among the nine sub-arrays in a stripe such that one sub-array in a stripe holds l/9 th of the 
page. (The spare arrays will be discussed in more detail below.) Figure 7 illustrates this 
arrangement, with different shadings used to depict different pages stored in the nine bays 
of each stripe. For example, for ECC word "A," bits A0-A7 would be stored in stripe 0, 
bay 0; bits A8-A15 would be stored in stripe 0, bay 1; etc. Likewise, for ECC word "B", 
bits B0-B7 would be stored in stripe 0, bay 0; bits B8-B15 would be stored in stripe 0, 
bay 1 ; etc. As shown by the shadings in Figure 7, in this preferred embodiment, bits of a 
given page are located in the same location in each of the sub-arrays. To distribute a page 
of memory to the nine bays in a stripe, the page register 140 is preferably divided into 
nine identical blocks (or "sub-registers"), each of which contains page data for one of the 
nine sub-arrays in a stripe {i.e., 179 th of the page). (As described below, each sub-register 
can also contain page data for l/9 th of what belongs to the spare array.) The read/write 
controllers ("rw_ctrl") on the top of each bay act as an interface between the sub-registers 
and the bays. 
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The page register 140 in this preferred embodiment incorporates bit sprinkling 
functionality to maximize the physical spacing between bits in a given oct-byte and, 
accordingly, to improve the effectiveness of the ECC protection. Further details on the 
bit sprinkling functionality can be found in "Memory Device and Method for Storing Bits 
in Non-Adjacent Storage Locations in a Memory Array," U.S. patent application serial 

no. (attorney docket no. 10519/60; filed on the same day as the present 

application), which is assigned to the assignee of the present application and is hereby 
incorporated by reference. While control logic is responsible for physically mapping a 
page into the memory array 150, data scrambling and de-scrambling in the sub-registers is 
enabled as a hardware feature of the page register 140 and is transparent to the control 
logic. Further, the SMI logic will see the data in its original form at all times. Figure 8 is 
an illustration of the page sub-register for bay 0. One byte of each word in the page is 
stored in the page sub-register from top to bottom. For example, bits A0-A7 of ECC 
word A are stored in rows 0-7 of the first column, bits B0-B7 of ECC word B are stored 
in rows 0-7 of the second column, etc. When bits are read out of the page sub-register 
and into a corresponding sub-array, the bits are read from left to right (e.g., AO B0 CO . . . 
HO). The bits read from the page sub-register are then stored in the corresponding sub- 
array, as illustrated in Figure 9. (Addressing of the columns is taken care of by the 
column decoders in the array.) As shown in Figure 9, because the direction that the bits 
were stored in the page sub-register is different from the direction that the bits were read 
out of the page sub-register, bits in an ECC word are located in non-adjacent locations in 
the memory array. Specifically, each bit is 64 storage locations apart in the sub-array 
(e.g., AO is 64 storage locations apart from Al), thereby achieving maximum separation 
between bits of an oct-byte. Accordingly, a local manufacturing defect must be large 
enough to extend across 65 bit locations before causing a double-bit error within an ECC 
word. A defect that extends across 64 bits locations will only result in a correctable 
single-bit error in 64 ECC words, instead of an uncorrectable multi-bit error in a single 
ECC word. This enhances the yield and reliability of the memory device 100 with a 
minimal amount of overhead and implementation cost. The same principles described 
above apply to storage of the ECC syndrome bits in the sub-array in bay 4. 
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As discussed above, the spare array in Bay 9 is physically made up of two 14-sized 
sub-arrays that logically act as a single array of 1024 rows and 512 columns (excluding 
redundancy and sacrificial test areas). To enable maximum separation between bits of an 
oct-byte in the spare array, bits are sprinkled within the page register 140. To simplify the 
interface between the page register 140 and the control logic, each page sub-register 
preferably is extended to hold a section of the spare array data as shown in Figure 10. 
The addition extends the sub-register by another bit (from eight to nine) for half of its 
height. As the page register 140 is made of nine sub-registers, the spare array data will be 
contained within the extended sections of these sub-registers as shown in Figure 1 1 . In 
Figure 11, each block represents one byte of data (one bit wide and eight bits deep). SP0 
and SP1 are the spare array data (16-bytes) and ECC0 and ECC1 are their respective ECC 
syndrome bits. XY is used to store redundancy/self-repair data. R is a reserved area, 
which can be used to store dynamic bit inversion information, as described in "Memory 

Device and Method for Dynamic Bit Inversion," U.S. patent application serial no. 

(attorney docket no. 10519/64; filed on the same day as the present application), which is 
assigned to the assignee of the present application and is hereby incorporated by 
reference. XYM and RM are masked bit locations in XY and R space. 

Each row in Figure 1 1 represents data as it will be written to the memory array 
150. This scheme maximizes spacing between data bits from the same oct-byte to four 
bits. However, the actual spacing achieved in the memory array 150 is significantly 
higher as data is written into the array 150 in groups of four with a large spacing 
introduced in between. This will give much of the intended result of bit sprinkling. Data 
scrambling and de-scrambling in the sub-registers for the spare array is enabled as a 
hardware feature of the page register 140 and is transparent to the control circuitry. The 
SMI logic will see the data in its original format at all times. It is apparent that with this 
scheme, in every cycle, the page register 140 would provide a nine bit output to the spare 
memory array. In order to utilize the existing array architecture based on eight-bit R/W 
circuits, the RW_CTRL circuit servicing the spare array tiles preferably steps down the 
incoming data stream from nine bits to eight bits by multiplexing the inputs from 
SubReg4 and SubReg5 such that bits tagged as XYM and RM are omitted. Preferably, 



these bit locations are masked out by the SMI Logic so they can be ignored without 
causing any functionality problems. 

The memory device 100 of this preferred embodiment comprises a 
redundancy/self-repair mechanism that improves reliability by identifying bad pages 
during write or test operations, marking the bad pages as faulty, and then placing the 
contents of the bad pages in another location. As mentioned above, the XY locations in 
the spare arrays are used to store redundancy/self repair data. The XY location comprises 
two different locations (an XX location and a YY location), which provide an indication 
about the repair status of each page. While locations XX and YY can be implemented as 
single bits, it is preferred that those locations use multiple bits to provide protection 
against faults. It is also preferred that the bits be stored in physically different tile 
elements. In one embodiment, some of the multiple bits of XX and YY are stored in one 
half array and the remaining bits are stored in the other half array. As the two half arrays 
are physically far apart, the probability of both locations being hit by a fault is relatively 
low. An error detection and correction scheme (such as quadruple redundancy) can also 
be implemented to further improve the reliability of these locations. It is preferred that 
the redundancy/self repair data in locations XX and YY be placed in the first columns of 
the spare arrays so that faults can be identified within the first cycle of a read operation 
without having to wait for an entire page to transfer. The read and write operations used 
in conjunction with the redundancy/self-repair mechanism will now be described with 
reference to the flow charts of Figures 12-14. 

Figure 12 is a flowchart illustrating acts of a read operation that relate to the 
redundancy/self-repair mechanism. At the start of a read operation, the first column in 
the spare array of the primary block (location YY) is read (act 300). Next, the memory 
device 100 determines whether location YY is programmed (act 310). As described 
below, location YY is programmed if an error occurred during a write operation. 
Accordingly, if location YY is not programmed, the read and ECC operations continue as 
normal since there is no indication that the page is known to be bad (act 320). However, 
location YY being programmed indicates that an earlier write operation in the page failed. 
In this situation, location XX in the primary block is read to determine whether that 
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location is programmed (act 330). As described below, location XX is programmed if an 
attempted repair was reported to be a failure. If location XX is programmed, the memory 
device 100 is trying to read a page where a repair failed. In that situation, the contents of 
the original location are read, and the status register 1 70 is set to indicate a read failure 
(act 340). However, if location XX is un-programmed (indicating a successful repair), a 
repair signal is asserted (act 350). The repair signal initiates a repair cycle by causing the 
row decoders to point to a redundancy row in the memory array 150 instead of a main 
row. With the row decoders pointing to the redundancy location, the memory device 100 
continues with the read operation, reading data of the page from the redundancy section 
of the array (act 360). When the read operation is completed, the repair signal is de- 
asserted (act 370). 

A write operation will now be illustrated in conjunction with Figure 13. With 
reference to Figure 13, an attempt is first made to write a page to the memory array 150 
(act 400), and a determination is made regarding whether an error has been detected (act 
405). In this preferred embodiment, an ECC scheme that can correct a single-bit error 
protects individual oct-bytes in a page, and an error is detected when more than one bit in 
a given oct-byte cannot be written into the memory array 1 50 as intended. If the bit 
sprinkling feature is used, it is preferred that the error reporting mechanism keep track of 
errors in all tiles in parallel since each eight-bit written entity comprises bits from eight 
different oct-bytes. To support this functionality, it is preferred that a 72 x 8 memory 
array be used, where a group of eight rows (out of the 72 rows) represents one of the bays, 
where each row in the group of eight rows represents an oct-byte boundary for the tile, 
and where there is a column for each of the eight bits in the written entity. In this 
preferred embodiment, faults occurring within the spare array are not tracked with this 
scheme due to the complexity of the operation across scrambled data. Instead, a simple 
double-fault detection scheme is preferably deployed for the spare array, where any two 
write errors within the same spare array boundary will trigger a write fault. Such a 
detection scheme is adequate given that the spare array contains only two oct-bytes per 
page. 



If an error is detected, the write operation continues until the entire page is 
written, and then the repair signal is asserted (act 410). In this preferred embodiment, an 
error in writing two or more bits in a given oct-byte triggers the replacement of the entire 
page comprising that oct-byte. Before attempting the replacement, the memory device 
100 reads location YY in the redundant block (act 415) and determines if location YY is 
programmed (act 420). This determines whether the targeted redundancy location is 
already occupied. If the targeted redundancy location is already occupied, the page 
cannot be repaired, and the repair signal is de-asserted (act 425). Locations XX and YY 
in the original location are then programmed with 0s to indicate that a repair was 
attempted but failed (act 430). The fault is also marked in the status register 170 (act 
435). If the targeted redundancy location is not already occupied, the page is written into 
the repair row (act 440), and a 0 is written in location YY to mark the redundancy 
location as being occupied (act 445). Next, it is determined whether a fault occurred in 
writing the page to the repair row (act 450). If a fault was detected, the repair failed, and 
acts 425-435 are performed as described above. However, if a fault was not detected, the 
repair was a success, and the repair signal is de-asserted (act 455). A 0 is written into 
location YY of the primary block to indicate that the repair was attempted and completed, 
and the write operation ends (act 465). 

As mentioned above, the memory device 100 can provide partial page write 
functionality. During a partial page write, an entire page is written to memory, but the 
unused sections of the page are padded with Is. In this way, one can write, for example, 
the first 60% of a full page with one page write command and then write the remaining 
40% of the page with a second page write command. In the presently preferred 
implementation, when a page is written, no indication is provided regarding whether the 
page is a partial page write or a full page write. Accordingly, every write operation is 
preferably treated as a partial page write. The redundancy/self-repair feature, however, 
preferably recognizes that a fault occurring during a partial page write will require 
transferring the entire page to the redundancy area. To accomplish this, the following two 
features are preferably added to the method illustrated in Figure 13. First, before any 
partial page write, locations YY and XX are read to see if the page was repaired before. 
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If so, the write operation preferably continues in the redundancy area. If not, the 
operation preferably operates as a regular write. Second, if there is a fault during the 
partial page write, the existing page information is preferably captured first and then 
written back into the redundancy area along with the original data. 

Figure 14 is a flowchart illustrating a write operation with these modifications. 
Before a write operation is performed, locations XX and YY in the primary block are read 
(act 500). If XX and YY both contain 0s, the page is bad and cannot be replaced. The 
write operation is completed in the original location, the error is reported in the status 
register 170, and the write operation is aborted (act 505). If YY contains a 0 and XX 
contains a 1 , the page is already in the redundancy area, and the write operation should be 
performed in that space. In this situation, the repair signal is asserted (act 510), and the 
full page is written into the repair row (act 515). If no fault occurs, the write was 
successful. However, if a fault occurs, the fault is marked in the status register 170, and 
the write operation is aborted (act 525). In this preferred embodiment, a fault in the 
redundancy area cannot be fixed. In an alternate embodiment, faults in the redundancy 
area are fixable. For example, additional redundant blocks can be allocated to replace 
faulty redundant blocks. 

If XX and YY both contain Is, the page is good, and a full page write is 
performed (act 530). If no fault occurs, the write operation is a success, as is expected for 
most write operations. However, if a fault occurs, the repair signal is asserted (act 540), 
and location YY in the redundant block is read (act 550). If location YY stores a 0, a 
repair is not possible since the redundant block is already occupied. The repair signal is 
de-asserted (act 555), 0s are written in the YY and XX locations of the primary block (act 
560), and the fault is marked in the status register 170 (act 525). In this preferred 
embodiment, if a partial page is being written when a fault occurs, it is preferred that the 
entire page be moved with the previously-written sections to the spare location. 
Accordingly, if location YY stores a 1 , a repair attempt is possible, the repair signal is de- 
asserted (act 565), and the page is reconstructed (act 570). The mechanism to reconstruct 
pages relies on the following assumptions. One, all unwritten (virgin) bits in the memory 
are guaranteed to be at a Logic 1 state. Two, during a write operation, a 0 will never be 
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accidentally written into a cell that is supposed to remain virgin. Three, errors reported 
during a write operation indicate that the memory device was unable to program a 0 into 
the memory array 150. Four, during a partial-page write, the page register 140 will 
contain the data for the section of the page (some multiple of oct-bytes), and the rest of 
the page will be padded with Is. Given these assumptions, a new page can be 
reconstructed by a simple AND operation between the contents of the memory and the 
page register 140. As the ECC syndrome bits are self-contained in the existing portion of 
the page, ECC does not have to be recomputed for the newly-constructed page. 
Furthermore, the page can be reconstructed in the page register 140 itself without the 
need for additional storage space if the page register 140 has separate read and write 
ports. 

After the page is reconstructed, the repair signal is asserted (act 575), and the 
reconstructed page is written into the redundancy area (act 580). During this time, 
location YY in the redundant block is set to 0, and the fault memory is cleared. (The fault 
memory is used to transfer the contents of the current page into the redundancy area. This 
memory can be physically mapped into the existing data register at the expense of some 
control logic.) If a fault occurs in writing the reconstructed page into the redundancy 
area, the repair fails, and acts 555, 560, and 525 are performed as described above. 
However, if no faults occur, the repair was successful. The repair signal is de-asserted 
(act 590), and 0 is stored in location YY of the primary block (act 595). 

There are several advantages associated with these preferred embodiments. For 
example, the preferred embodiments can be used in non- volatile memories (write-once or 
write-many memories such as Flash, MRAM, FRAM, and organic polymer diodes) to 
improve yield due to failures that happen after the part ships as well as to reduce test 
costs. Reduction of test cost can be particularly important as the bit costs reduce in 
comparison to the test costs. Additionally, "on-the-fly" redundancy/self-repair increases 
the yield of a memory array when combined with ECC. Consider, for example, the 
situation in which the block size is one oct-byte and the target number of defects per 
million (dpm) memory arrays for a 64 MB memory is 50. By allocating eight bits of ECC 
for every 64 bits of data to protect for single-bit failures (a 12.5% overhead), the number 
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of single-bit defects that can be allowed per chip is about 28.3. However, if ECC were 
combined with on-the-fly redundancy with direct mapping of 1,000 oct-bytes to one 
redundant oct-byte, the number of defects per chip is increased to about 1,423. 
Accordingly, a factor of 50 improvement in defects per chip is achieved with an overhead 
of one bit per 72 bits (a 1 .4% overhead) and an extra oct-byte (a 0.1% overhead). 

There are several alternatives that can be used with these preferred embodiments. 
For example, in the preferred embodiments described above, the redundancy feature was 
used to provide "on-the-fly" self-repair when field programming a memory array. In an 
alternate embodiment, the redundancy feature can be used when testing the memory array 
either at the factory or in the field. The redundancy/self-repair feature can be 
implemented as part of a testing apparatus, can be combined with other test features, or 
can be used by itself as a built-in self-test ("BIST") structure. In the BIST 
implementation, the memory device can perform simple logic cycling through the address 
space of the memory array and test the test columns. Upon detection of an error, the self- 
repair feature can automatically mark the faulty block of memory cells and provide a 
redundant block in the redundancy area of the memory array. In operation, in response to 
an error in writing test bits in the primary block, a flag is stored in a set of memory cells 
allocated to that primary block. Later, when a command is received to write to the 
primary block, the set of memory cells allocated to the primary block is read. In response 
to reading the flag stored in the set of memory cells, the memory device writes to the 
redundant block. 

In another alternate embodiment, ECC bits are from different word lines, and the 
self-repair replacement block are from different bit lines. In yet another alternate 
embodiment, instead of implementing the ECC functionality as a hardware component in 
the memory device, the ECC functionality can be implemented as a hardware and/or 
software component in the host device. Additionally, the ECC functionality can be 
distributed between the memory device and the host device. Further, the preferred 
embodiments can be used without ECC or other error protection schemes. Also, the flag 
bit(s) can have a multi-bit parity coding that is suitable for write-once memories. This 
allows the parity to work even after new bits are programmed. 



These preferred embodiments can be implemented with any suitable memory 
device, such as a solid-state memory device (i.e., a memory device that responds to 
electrical read and write signals to cause digital information to be read from and stored in 
a memory array of the device), a magnetic storage device (such as a hard drive), or an 
optical storage device (such as a CD or DVD). The memory array can comprise write- 
once or write-many volatile or non-volatile memory cells. Although any suitable type of 
memory cell can be used, in one preferred embodiment, the memory cell comprises an 
anti-fuse and a diode. The memory array can be made from any suitable material. In one 
preferred embodiment, the memory array comprises a semiconductor material. Other 
materials can be used, such as, but not limited to, phase-change materials and amorphous 
solids as well as those used with MRAM and organic passive element arrays, as described 
in U.S. Patent No. 6,055,180, which is hereby incorporated by reference. 

The memory array can comprise a two-dimensional or three-dimensional memory 
array of memory cells. In one preferred embodiment, the memory device takes the form 
of a solid-state memory device having a three-dimensional memory array of vertically- 
stacked field-programmable memory cells, as described in U.S. Patent No. 6,034,882 to 
Johnson et al., U.S. Patent No. 5,835,396 to Zhang, and U.S. patent application serial no. 
09/560,626, all of which are hereby incorporated by reference. As discussed in those 
documents, three-dimensional memory arrays provide important economies in terms of 
reduced size and associated reductions in manufacturing cost. It is important to note that 
the following claims should not be read as requiring a specific type of memory device 
(e.g., write-once or write-many) or specific type of memory array (e.g., two dimensional 
or three-dimensional) unless explicitly recited therein. 

In one embodiment, a memory device comprising a three-dimensional memory 
array comprises redundancy circuitry. The term "redundancy circuitry" is intended to 
broadly refer to any structure that is operative to write to a redundant block in response to 
an error in writing to a primary block of the memory array. Redundancy circuitry can 
perform additional functions. For example, redundancy circuitry can be used in read 
operations. While redundancy circuitry can be designed to store a flag in a set of memory 
cells allocated to the primary block or as a redundancy address matching circuit, 



redundancy circuitry can take other forms and should not be limited to the redundancy 
features described herein. 

In one preferred embodiment, the memory device takes the form of a modular, 
compact, handheld unit, such as a memory card or stick, that comprises an external 
electrical connector that can be coupled with a host device, such as a digital camera, 
digital audio player, or other portable consumer product. 

It is intended that the foregoing detailed description be understood as an 
illustration of selected forms that the invention can take and not as a definition of the 
invention. It is only the following claims, including all equivalents, that are intended to 
define the scope of this invention. Finally, it should be noted that any aspect of any of the 
preferred embodiments described herein can be used alone or in combination with one 
another. 



